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Abstract 

Finite-turn pushdown automata (PDA) are investigated concerning their descrip- 
tional complexity. It is known that they accept exactly the class of ultralinear context- 
free languages. Furthermore, the increase in size when converting arbitrary PDAs ac- 
cepting ultralinear languages to finite-turn PDAs cannot be bounded by any recursive 
function. The latter phenomenon is known as non-recursive trade-off. In this paper, 
finite-turn PDAs accepting bounded languages are considered. First, letter-bounded 
languages are studied. We prove that in this case the non-recursive trade-off is reduced 
to a recursive trade-off, more precisely, to an exponential trade-off. A conversion al- 
gorithm is presented and the optimality of the construction is shown by proving tight 
lower bounds. Furthermore, the question of reducing the number of turns of a given 
finite-turn PDA is studied. Again, a conversion algorithm is provided which shows 
that in this case the trade-off is at most polynomial. Finally, the more general case 
of word-bounded languages is investigated. We show how the results obtained for 
letter-bounded languages can be extended to word-bounded languages. 

Key words: automata and formal languages, descriptional complexity, finite-turn push- 
down automata recursive trade-offs, bounded languages 



1 Introduction 



Finite-turn pushdown automata (PDAs) were introduced in [5] by Ginsburg and Spanier. 
They are defined by fixing a constant bound on the number of switches between push and 
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pop operations in accepting computation paths of PDAs. The class of languages defined 
by these models is called the class of ultralinear languages and is a proper subclass of the 
class of context-free languages. It can be also characterized in terms of ultralinear and 
non-terminal bounded grammars [5]. (In the special case of 1-turn PDAs, i.e., devices 
making at most one switch between push and pop operations, we get the class of linear 
context-free languages) . 

In [12], descriptional complexity questions concerning finite-turn PDAs were investigated, 
by showing, among other results, the existence of non-recursive trade-offs between PDAs 
and finite-turn PDAs. Roughly speaking, this means that for any recursive function /(n) 
and for arbitrarily large integers n, there exists a PDA of size n accepting an ultralinear 
language such that any equivalent finite-turn PDA must have at least f{n) states. Thus, 
a PDA with arbitrary many turns may represent an ultralinear language more succinctly 
than any finite-turn PDA and the savings in size cannot be bounded by any recursive 
function. 

This phenomenon of non-recursive trade-offs was first observed between context-free gram- 
mars and deterministic finite automata (DFAs) in the fundamental paper by Meyer and 
Fischer [13j . Nowadays, many non-recursive trade-offs are known which are summarized, 
e.g., in [2] and [Tl]. In the context of context-free languages non-recursive trade-offs are 
known to exist between PDAs and deterministic PDAs (DPDAs), between unambiguous 
PDAs (UPDAs) and DPDAs, and between PDAs and UPDAs. Recursive trade-offs are 
known, e.g., between nondeterministic/alternating finite automata and DFAs and between 
DPDAs and DFAs. 

Interestingly, the witness languages used in |13j were defined over an alphabet of two 
symbols and leave open the unary case which was recently solved in [T3] by proving an 
exponential trade-off. Thus, the non-recursive trade-off in the binary case turns into 
a recursive trade-off in the unary case. More generally, a careful investigation of the 
known cases of non-recursive trade-offs reveals that the used witness languages are not 
bounded resp. word-bounded, i.e., they are not included in some subset of wlw2 ■ ■ ■ for 
some fixed words wi,W2,---, Wm- So, the question arises whether the above non-recursive 
trade-offs can be translated to the bounded case or whether the structural limitation on 
boundedness is one that will allow only recursive trade-offs. 

In this paper, we tackle this question and restrict ourselves initially to the case of letter- 
bounded languages, namely, subsets of al . . . a^, where ai, . . . , are pairwise distinct 
symbols. Our main result shows that for these languages the trade-off between PDAs 
(or context-free grammars) and finite-turn PDAs becomes recursive. More precisely, in 
Section [3] we first show that each context-free grammar in Chomsky normal form with 
h variables generating a letter-bounded set can be converted to an equivalent finite-turn 
PDA whose size is 2^^^\ Furthermore, the resulting PDA makes at most m — 1 turns 
where m is the number of letters in the terminal alphabet. In a second step, an exponential 
trade-off is also shown for arbitrary context-free grammars. 

We prove in Section [5] that this result is tight by showing that the size of the resulting PDA 
and the number of turns cannot be reduced. Note that this result is a generalization of 
the above-mentioned transformation of unary context-free grammars into finite automata 
which is presented in In Section [4] the investigation is further deepened by studying 
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how to reduce the number of turns in a PDA. In particular, given a fc-turn PDA accepting 
a subset of a^a^ ■ ■ ■ aj^, where A; > m — 1, we show how to build an equivalent (m — l)-turn 
PDA. It turns out that in this case the trade-off is polynomial. This result is also used 
to prove the optimality of our simulation of PDAs accepting letter-bounded languages by 
finite-turn PDAs. Finally, in Section [H we consider word-bounded languages. Based on 
the constructions for letter-bounded languages in the previous sections, we are able to give 
similar constructions for word-bounded languages. Thus, similar upper and lower bounds 
can be obtained for the general situation of word-bounded languages. 

We would like to remark that bounded context-free languages have very appealing proper- 
ties concerning their decidability questions. It is known |3| that equivalence and inclusion 
problems are decidable whereas both problems are undecidable for context-free languages 
and inclusion is an undecidable problem for deterministic context-free languages. Further- 
more, it is decidable whether a given context-free grammar generates a bounded language. 
In the positive case, the words wi,W2,---, Wm can be effectively calculated. For the mem- 
bership problem we know the Cocke- Younger-Kasami algorithm which solves the problem 
in cubic time. It is shown in [9] that letter-bounded context-free languages can be accepted 
by a certain massively parallel computational model. This result implies that the mem- 
bership problem for letter-bounded context-free languages can be solved in quadratic time 
and linear space. Since the membership problem for word-bounded context-free languages 
can be reduced to the membership problem for letter-bounded context-free languages by 
using suitable inverse homomorphisms, we obtain identical time and space bounds also in 
the word-bounded case. 



2 Preliminaries and Definitions 

Given a set S, #5 denotes its cardinality. Let S* denote the set of all words over the finite 
alphabet S, with the empty string denoted by e, and 11+ = S*\{e}. Given a string x E S*, 
|x| denotes its length. For the sake of simplicity, we will consider languages without the 
empty word e. However, our results can be easily extended to languages containing e. Let 
REG denote the family of regular languages. We assume that the reader is familiar with 
the common notions of formal language theory as presented in [8] . 

A context-free grammar (CFG, for short), is a 4-tuple G = {V,T,, P, S), where V is the 
set of variables, S is the set of terminals, V and S are disjoint sets, S" G ^ is the initial 
symbol and P Q V x (V U S)* is the finite set of productions. A production {A, a) € P 
is denoted by A — > a. The relations =^, and =^ are defined in the usual way. Given 
a,P € (y U S)*, if 6* is a derivation of P from a, then we write 9 : a ^ p. A useful 
representation of derivations of context-free grammars can be obtained using parse trees. 

A parse tree (or tree, for short) for a context-free grammar G is a labeled tree satisfying 
the following conditions: 

(1) Each internal node is labeled by a variable in V. 

(2) Each leaf is labeled by either a variable, a terminal, or e. However, if the leaf is 
labeled e, then it must be the only child of its parent. 
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(3) If an internal node is labeled with a variable A, and its children, from left to right, 
are labeled with Xi, X2, ■ ■ ■ , Xj. G y U S, then A ^ X1X2 ■ ■ ■ Xj. is a production of 
G. 

If r is a parse tree whose root is labeled with a variable A £ V and such that the labels 
of the leaves, from left to right, form a string a £ {V L) S)*, then we write T : A ^ a. 
Furthermore, we indicate as i^(T) the set of variables which appear as labels of some nodes 
in T. 

The language generated by the grammar G, i.e., the set {x € T.* \ S ^ x}, is denoted by 
L{G). 

The class of languages generated by CFGs is called the class of context-free languages. It 
is well-known that the class of context-free languages properly contains the class of regular 
languages (i.e., the languages accepted by finite automata), but in the unary case, these 
two classes collapse [4]. 

A grammar G = {V,T,, P, S) is said to be in Chomsky normal form if and only if its 
productions have the form A BG or the form A ^ a, with A,B,C G V and a G S. 
It is well-known that each context-free language not containing the empty word can be 
generated by a context-free grammar in Chomsky normal form. 

A production yl ^ a of a context-free grammar G = {V, S, S, P) is said to be right-linear 
(left-linear, linear) if a G S*y (a G VT.*, a G T.*VT.*). 

A context-free grammar G = (V,!^, S, P) is said to be ultralinear [5] if y is a union of 
pairwise disjoint (possibly empty) subsets Vq, . . . ,Vn of V with the following property. For 
each Vi and each A € Vi, each production with left side A is of the form A ^ w, where 
either w G T,*Vi'E* or G (S U Vq U . . . U Vi-i)*. The family of languages generated by 
ultralinear grammars is called ultralinear languages and denoted by ULTRALIN. 

A context-free grammar G = {V,Y^, S, P) is said to be non-terminal bounded [5] if there 
exists an integer k with the following property: li A ^ w, w (z {V L) S)*, A V, then w 
has at most k occurrences of variables. The rank rdw) of a word w G (VUT,)* is defined 
to be the largest integer r such that there is a word u G {V UT,)*, with r occurrences of 
variables, such that w ^ u. It is known [5] that a context-free grammar G is ultralinear 
if and only if G is non-terminal bounded. 

For each ultralinear grammar G, the rank of G is defined as the largest integer which is 
the rank of one of the variables. Let L be an ultralinear language. The rank of L, r(L), 
is defined as zero, if L is regular. If L is nonregular, then r{L) is defined as the smallest 
integer which is the rank of some ultralinear grammar generating it. 

Let M = {Q,'S,T,S,qo, Zq, F) be a pushdown automaton [8]. A configuration of a push- 
down automaton is a triple (g, w, 7) where q is the current state, w the unread part of the 
input, and 7 the current content of the pushdown store. The leftmost symbol of 7 is the 
topmost stack symbol. We write (g, aw, Z7) h (p, w, /?7), if 5{q, a, Z) 3 {p, (3) for p,q & Q, 
a G S U {e}, w G S*, 7,/3 G F*, and Z G F. The refiexive and transitive closure of h is 
denoted by h*. The language accepted by M with accepting states is 

r(M) = G S* I {qo, w, Zo) h* {q, e, 7) with g G F and 7 G F*}. 
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A sequence of configurations of M [qi^wi^^i) h . . . h {qk^Wk-,^k) is called one-turn \X\ if 
there exist 1 < i < j < k sucli that 

l7il < ••• < l7i-il < l7il < l7i+il < ••• < hj\ > hj+i\ > ■■■> hk\- 

A sequence of configurations cq H . . . h Cm is called k-turn if there are integers = 
io, . . . ,ii = 171 with I < k such that for j = 0, — 1 the subsequences Cj^. h . . . h q^^-^ 
are one-turn, respectively. M is a fc-turn pushdown automaton if every word w € T{M) 
is accepted by a sequence of configurations which is /c-turn. 

By C{k-turn PDA) we denote the family of languages accepted by A;-turn PDAs. The 
union of all A;-turn PDAs with fixed A; > 1 is the set of finite-turn PDAs. The family of 
languages accepted is defined as C{finite-turn PDA) = |Jfc>i C{k-turn PDA). 

Thus, /c-turn PDAs are allowed to make new turns not depending on the stack height. 
The following characterization of ultralinear languages by finite-turn PDAs may be found 
in [1] and respectively. 

Theorem 1 A language L belongs to ULTRALIN if and only if there is a k 'N such 
that L is accepted by a k-turn PDA. 

We want to consider in this paper PDAs in a certain normal form. Thus, we make, without 
loss of generality, the following assumptions about PDAs (cf. [S]). 

(1) At the start of the computation the pushdown store contains only the start symbol 
Zq; this symbol is never pushed or popped on the stack; 

(2) the input is accepted if and only if the automaton reaches a final state, the pushdown 
store only contains Zq and all the input has been scanned; 

(3) if the automaton moves the input head, then no operations are performed on the 
stack; 

(4) every push adds exactly one symbol on the stack. 

The transition function 5 of a PDA M then can be written as 

In particular, for q,p € Q, A, S € F, a G S, (p, — ) e 5{q,a,A) means that the PDA M, 
in the state q, with A at the top of the stack, by consuming the input a, can reach 
the state p without changing the stack contents. ip,pop) G 5{q,e,A) {{p,push{B)) G 
S{q, e, A), (p, —) G d{q, e, A), respectively) means that M, in the state g, with A at the top 
of the stack, without reading any input symbol, can reach the state p by popping off the 
stack the symbol A on the top (by pushing the symbol B on the top of the stack, without 
changing the stack, respectively). 

A descriptional system D is a recursive set of finite descriptors (e.g. automata or gram- 
mars) relating each A D to a language T{A). It is additionally required that each de- 
scriptor A & D can be effectively converted to a Turing machine such that T{Ma) = 
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T{A). The language family being described by D is T{D) = {T{A) \ A E Z)}. For every 
language L we define D{L) = {A ^ D \ T{A) = L). A complexity measure for D is a total, 
recursive, finite-to-one function | • | : D — > N such that the descriptors in D are recursively 
enumerable in order of increasing complexity. Comparing two descriptional systems Di 
and D2, we assume that T{Di) n T{D2) is not finite. 

We say that a function / : N — > N, f{n) > n is an upper bound for the blow-up in 
complexity when changing from one descriptional system Di to another system D2, if 
every description A Di of size n has an equivalent description A' € D2 of size at most 
fin). 

We say that a function : N — > N, g{n) > n is a lower bound for the trade-off between two 
descriptional systems Di and D2, if there is an infinite sequence iV C N and an infinite 
sequence {Ln)neN of pairwise distinct languages such that for all n G there is a 
description A ^ Di for L„ of size n and every description A' S -D2 for L„ is at least of size 

gin). 

According to the discussion in [6] the size of a PDA should be defined depending on the 
number of states, the number of stack symbols, the number of input symbols, and the 
maximum number of stack symbols appearing in the right hand side of transition rules. 
In this paper, we consider PDAs in the above defined normal form over a fixed alphabet 
S. Thus, size{M) of a PDA M in normal form is defined as the product of the number 
of states and the number of stack symbols. It can be observed that this measure fulfills 
the above defined conditions on descriptional measures. The size of a finite automaton is 
defined to be the number of states. 

As a measure for the size of a context-free grammar G = {V,Ti, P, S) we consider the 
number of symbols of G, defined as Symb(G) = J2(A^a)ep('^~'^\'^\) (*^^- ffS])- Furthermore, 
in the paper it will be useful also to consider the number of variables of G, defined as 
Var(G) = #y (note that this function in general is not a measure for the size). Some 
general information on descriptional complexity may be found in [2\. 

3 From Grammars to Finite-Turn PDAs 

In this section, we study the transformation of CFGs into finite-turn PDAs. Our main 
result shows that given a grammar G of size h, we can build an equivalent finite-turn PDA 
M of size 2^^^\ Furthermore, if the terminal alphabet of G contains m letters, then M is 
an (m — l)-turn PDA. The tightness of the bounds will be shown in Section [5l For the sake 
of simplicity, we start by considering CFGs in Chomsky normal form with the measure 
Var. At the end of the section, we will discuss the generalization to arbitrary context-free 
grammars, taking into consideration the size defined by the measure Symb. 

In the following we consider an alphabet S = {oi, . . . , am} and a CFG G = {V, S, P, S) 
in Chomsky normal form with h variables, generating a subset of . . . a^. Without loss 
of generality, we can suppose that each variable of G is useful, i.e., for each A £ V, there 
exist terminal strings u,v,w, such that S =^ uAw =^ uvw. 

Lemma 1 For each variable A £ V there exists an index I, 1 < I < m (r, 1 < r < m, 
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resp.) such that if A =^ uAv, with u,v (z S*, then u € (v € a*, resp.). Furthermore, if 
there exists at least one derivation A ^ uAv with u ^ e, (v ^ e, resp.) then such an I (r, 
resp.) is unique. 

Proof: It is easy to see that ii A^ uAv and u contains at least two letters a^/, a///, with 
I' 7^ then, because A ^ uuAvv, the language generated by G should contain a string 
not belonging to . . . a^. Hence, u € aj, for some 1 < I < m. 

Now suppose that u ^ e and A =^ u'Av'. Because A =^ uu'Av'v, using the previous 
argument it is easy to conclude that u' € a^. 

A similar argument can be given for the right part. □ 

For each variable A such that A =^ uAv, with u,v ^ S"*", we denote by border{A) the 
unique pair {l,r) of indices, given in Lemma [TJ On the other hand, if for any derivation 
A =^ uAv., the string v is empty, then we define border (A) as the pair (1,1), and if for any 
derivation A =^ uAv the string u is empty, then we define border {A) as the pair (r, r). If 
there are no derivations of the form A ^ uAv, then we leave border{A) undefined. 



Formally, 



border (A) = < 



(/, r) if ^ =^ uAv for some u € a^, w € 

(/, /) if u € af^ and v = e for any A =^ uAv 

(r, r) if n = e and w € for any A =^ uAv 

undefined otherwise. 



For the sake of brevity, border{A) will be denoted also as {Ia,^a), if defined. 

We now consider the relation < on the set of possible borders defined as {l,r) < {l',r') if 
and only if / < Z' and ii I = I' then r > r', for all {l,r), {l',r') € {1, . . . ,m}^, with I < r 
and /' < r'. It is not difficult to verify that < is a total order on the set of pairs of indices 
/, r from {1, . . . , m}, such that / < r. 

Actually, we are interested in computing borders of variables belonging to the same deriva- 
tion tree. In this case, either a variable is a descendant of the other in the tree, and then 
the interval defined by its border is inside the interval defined by the border of the other 
variable, or one variable is to the right of the other one, and then the corresponding 
interval is to the right of the other one. More formally, we can prove the following: 

Lemma 2 Let T be a derivation tree and A,B & ^{T) be two variables. If border [A) < 
border (B), then either: 

(a) Ia<Ib <rB < TA, or 

(b) Ia < Ib, ta < TB, and rA < h- 



Proof: The case border{A) = border{B) is trivial. Thus, for the rest of the proof we 
suppose that border{A) ^ border{B). 
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If A and B lie in T on the same path from the root, then A must be closer to the root 
than B (otherwise border (B) < border (A)). It is immediate to conclude that in this case 
Ia < Ib < rB < TA- 

Otherwise, A should appear in a path to the left of the path containing B. This easily 
implies that Ia 1^ < Ib 1^ 1"B- We consider the two subcases rA = tb and rA < tb- 

li rA = rB then we get Ia < Ib = i^b = rA, so (a) holds. On the other hand, if < 
then it is not possible that Ia = Is because this should imply, with border(A) < border(B), 
the contradiction < r^. Hence, Ia < Ib that, with < and < Ib, gives (b). □ 

A partial derivation tree (or partial tree, for short) U : A ^ vAx is a parse tree whose 
root is labeled with a variable A and all the leaves, with the exception of one whose label 
is the same variable A, are labeled with terminal symbols. 

Given a partial tree U : A ^ vAx, any derivation tree T : S ^ z with A E y{T) can be 
"pumped" using U , by replacing a node labeled A in T with the subtree U . In this way, 
a new tree T' : S" ^ z' is obtained, where z' = uvwxy, such that z = uwy, S uAy, and 
A^w. Moreover, u{T') = v{T) U v{U). 

On the other hand, any derivation tree producing a sufficiently long terminal string can 
be obtained by pumping a derivation tree of a shorter string with a partial tree. This 
fact, which is essentially the pumping lemma of context-free languages (see, e.g., [8]), is 
recalled in the following: 

Lemma 3 Let T : S ^ z be a derivation tree of a string z € S*. // \z\ > 2^^^ then we 
can write z = uvwxy, such that < \vx\ < 2^^, there is a tree T' : S ^ uwy, a variable 
A G z^(r') and a tree T" : A ^ vAx such that v{T) = v{T') U v{T") and A =^ where h 
is the number of the variables of the grammar G. 

Proof: First of all, we recall that given a parse tree of a string a according to a context- 
free grammar G in Chomsky normal form, if the longest path from the root to a leaf in 
the tree has length k (measured by the number of edges), then |a| < 2'^"^ (see, e.g. ,[8]). 
Using this property, we get that if \z\ > 2^~^ then the tree T : S ^ z must contain a path 
of /i + 1 edges from a node n to a leaf. Hence, this path must contain two nodes labeled 
with the same variable A. This defines the decomposition of T in T' and T" . Again by the 
above property, the terminal string vwx generated by the subtree rooted at n has length 
bounded by 2^ . Furthermore, > 1, because the grammar is in Chomsky normal form. 
Hence \vx\ <2^. □ 

By applying the pumping lemma several times, we can prove that any derivation tree can 
be obtained by starting from a derivation tree of a "short" string (namely, a string of 
length at most 2'^""'^) and iteratively pumping it with "small" partial trees. Furthermore, 
a sequence of partial trees can be considered such that the sequence of borders of their 
roots is not decreasing. This fact will be crucial to get the main simulation presented in 
this section. More precisely, we are able to prove the following result: 

Lemma 4 Given a derivation tree T : S ^ z of a string z G S*, with \z\ > 2^^^^ , for 
some integer k > there are: 
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• A; + 1 derivation trees Tq, Ti, . . . , Tk, where Ti : S ^ Zi, i = f), . . . ,k, < \zq\ <2^'' , 
Tk = T, Zk = z; 

• k partial trees Ui, . . . Uk, where, for i = 1, . . . ,k, Ui : Ai ^ viAiXi, < \viXi\ < 2^, 
and Ti is obtained by pumping Tj_i with Ui. 

Furthermore, border (Ai) < border {A2) < • • • < border{Ak)- 

Proof: We can build the sequence from the end starting from T and decomposing it in 
a tree T' and a partial tree U, according to Lemma [3l This process can be iterated until 
a derivation tree Tq producing a string of length bounded by 2^~^ is obtained. 

In order to get a sequence of partial trees such that the sequence of the borders of the 
variables labeling their roots is not decreasing, at each step a partial tree is selected, 
among all possible candidates, in such a way that the border of its root is maximum. In 
other words, for i = 1, . . . ,k, the variable Ai labeling the root of the tree Ui : Ai ^ ViAiXi 
satisfies: 

border (Ai) = m.ax{border(A) \ there exists a partial tree 
U : A^ vAx of Ti with < \vx\ < 2'^}. 

We prove that with this choice border{Ai) < border{Ai^i), for i = 1, . . . , A; — 1. This is 
obvious if the partial tree U : Ai =^ ViAiXi of Tj is also a partial tree of Tj+i, namely, it 
belongs to the set of candidates when C/j+i is chosen to reduce Tj+i. If this is not the case, 
then Tj+i should contain a partial tree U' : Ai ^ v'Aix', with \v'x'\ > 2^, such that, after 
removing i7j+i, U' is reduced in Ti to U. It can be observed that in Tj+i such a reduction 
is possible only if the root Ai^i of U+i is a descendant of the root Ai of U' . Hence, in Tj+i 
the terminal string generated by the subtree whose root, labeled Aj+i, coincides with the 
root of C/i+i, must be a factor of the terminal string generated by the subtree whose root, 
labeled Ai, coincides with the root of U'. This implies that Ia^ < ^Ai+i < '"A^+i < ^a^- 
Hence, border{Ai) < border{Ai^i). □ 



Example: The language L = {a'^^'' a'^'^^ a^'^^ \ n,k,p > 0} can be generated by a 
grammar in Chomsky normal form with the following productions: 



s ^ 


AiE 


S'- 


AB 


A- 


> AiF 


B A2G 


Ai 01 


E 


SAs 


A - 


A1A2 


F - 


^AA2 


G BA3 


A2 02 


E 


S'A; 


B - 


^2^3 








As 03 


Note that S 


^ aiSas, 


A^. 


> aiAa2, 


and B - 


^ 02-803 • 


It is easy to 


get a tree Tq : S 



0^0203 and three partial trees U' : S ^ aiSa^, U" : A ^ 01^02, and U'" : B ^ 02-803. 
Given integers n, k,p > 0, a. derivation tree for the string o^^'^Og^^Og'*'" can be obtained 
considering Tq, and pumping it n — 1 times with the tree U' , k — 1 times with the tree 
U", and p - 1 with the tree U'" . Note that border{S) = (1,3) < border{A) = (1,2) < 
border {B) = (2,3). 

Lemma H] suggests a nondeterministic procedure which can be used to generate all the 
strings o^^ . . . o^ belonging to the language L{G): at the beginning a derivation tree 
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T : S ^ a"^ . . . of a "short" string is selected. Then the procedure enters a loop which 
is repeated a nondeterministically chosen number of times. At each iteration, the tree T 
so far considered is pumped with a nondeterministically chosen partial tree U : A ^ vAx 
(note that, by Lemma [H v G a^*^, x € a*^) such that A is a variable occurring in T. Note 
that, to implement this strategy, the procedure does not need to remember the whole tree 
T but only the set of variables occurring in it. 

The procedure is the following: 

nondeterministically select a tree T : S => a^^aj^ . . . , 

with ni + 712 + . . . + n™, < 2''"^ 
fci ^ Til, k2 ^ n2, . . ., km ^ rim 
enabled ^ v{T) 

iterate «— nondeterministically choose true or false 
while iterate do 

nondeterministically select a tree U : A ^ vAx, with < \vx\ < 2'', 
and A £enabled // border{A) = (/ajTa) 

ki^ ^ ki^ + \v\ 

enabled ^enabled U J^(I7) 

iterate ^ nondeterministically choose true or fa7se 
endwhile 
output aj^flj^ . . . 

Now, we will convert the above procedure into an (m — l)-turn PDA recognizing the 
language generated by the grammar G. For the sake of simplicity, let us start by describing 
the case m = 2 with v G and x G for each partial tree U : A ^ vAx. The PDA 
uses two bounded counters ni, n2 in order to remember the string a^^a^^ generated by the 
initial "small" tree. In a preliminary phase, the PDA consumes ni occurrences of oi from 
the input tape, in order to verify that a"^ is a prefix of the input (otherwise it stops and 
rejects). Subsequently, the automaton starts the simulation of the loop above described 
where, at each iteration, a partial tree U : A ^ vAx, with A G enabled is used to pump 
the generated string. To this aim, the automaton reads v & from the input tape and 
pushes a; G a2 on the stack (if v is not a prefix of the remaining part of the input tape, then 
the automaton stops and rejects). At the end of the loop, the pushdown store will contain 
a string 02, for some p > 0. Finally, the automaton accepts if and only if the remaining 
part of the input is 02^^^. We can observe that the automaton so described simulates the 
derivation of a string and verifies its matching with the input string. For the occurrences 
of the letter ai, the matching is verified immediately, by comparing the generated factors 
with the input string; for the occurrences of the letter 02, the verification of the matching 
is postponed: the generated factors are kept on the stack and compared with the input in 
the final phase. 

This strategy can be extended to the general case by pumping, according to Lemma [H 
with partial trees such that the sequence of the borders of their roots is not decreasing. 
More precisely, the PDA implements the following nondeterministic procedure, whose 
correctness is proved in Lemma [5] and Theorem [2l 
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nondeterministically select a tree T : 5 => a"^a2^ . . . , 

with m + n2 + . . . + nm. < 2''"^ 
read o"^ from the input 
enabled ^ v{T) 

{l,r) ^ (^,rn) // the "work context" 

iterate <— nondeterministically choose true or faJse 

while iterate do 

nondeterministically select a tree U : A ^ vAx, with < \vx\ < 2^, 

A £enabled, and {l,r) < border{A) = (IajTa) 
if r < ta then //new context to the right of the previous one 

for j <— Z + 1 to ) — 1 do 

consumelnputAndCounter(j) 

endfor 

for ji <— r to Ia do 

consumelnput AndCounter (j ) 

consumeInputAndStack(j) 
endfor 

else / /va < r: new context inside the previous one 

for j <— Z + 1 to Za do 

consumelnput AndCounter(j') 

endfor 
endif 

(Z,r) ^ (lA,rA) 
read v from the input 
it r then push x on the stack 
else read x from the input 

endif 

enabled ^enabled U i^{U) 

iterate <— nondeterministically choose true or faise 
endwhile 

for j ^ I + 1 to r — 1 do 

consumelnput AndCounter (j) 
endfor 

for j <— r to m do 

consumelnput AndCounter {j ) 
consumelnput AndStack(j) 
endfor 

if the end of the input has been reached then accept 

else reject 

endif 

In the previous procedure and in the following macros, the instruction "read x from the 
input tape," for x € S*, actually means that the automaton verifies whether or not x is 
a prefix of the next part of the input. If the outcome of this test is positive, then the 
input head is moved immediately to the right of x, namely, x is "consumed," otherwise 
the machine stops and rejects. 

The macros are defined as follows: 

Consumelnput AndCounter ( j ) : 
while Uj > do 

read Oj from the input tape 

nj -1 
endwhile 
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Consumelnput AndStack(j) : 

while the symbol at the top of the stack is aj do 

read aj from the input tape 

pop 
endwhile 



In order to prove that the pushdown automaton described in the previous procedure 
accepts the language L(G) generated by the given grammar G, it is useful to state the 
following lemma: 

Lemma 5 Consider one execution of the previous procedure. Let Tq : S ^ a"^ . . . oj^ be 
the tree selected at the beginning of such an execution. At every evaluation of the condition 
of the while loop, there exists a tree T : S ^ a^^ . . . a^, for some ki, . . . , km > 0, such 
that 

• the scanned input prefix is z = a^^ ■ ■ ■O'l' / 

• the pushdown store contains the string 7 = ar"" ...a^, where, for j = r,...,m, 

Pj > and kj = pj + nj ; 

• for I < j < r, kj = Uj; 

• enabled = v{T). 

Proof: It is easy to see that at the first evaluation of the condition it holds that 1 = 1, 
r = m, ki = ni, . . . ,km = rim, Pm = 0, the scanned input prefix is a^^ and the pushdown 
store is empty, namely, it contains a^. 

We now suppose the statement to be true before the execution of one iteration and we 
show that it still holds true at the end of the iteration. Let U : A ^ vAx be the partial 
tree selected in the while loop. Because A € I'iT), the derivation tree T : S ^ a^^ . . . 

* k' k' 

can be pumped with the partial tree C/, obtaining a new tree T : S ^ a-^^ . . . Om , with 
z^(T') = i^(r) U v{U), where (in the case Ia 7^ ta) k[^ = ki^ + \v\, k'^^ = kr^ + |x| 
and kj = kj for each j 7^ Ia and j 7^ (in the special case Ia = ^a we have that 
= ^'ta = ^'a + \vx\). 

We now consider two subcases, corresponding to the selection in the while loop. 
Case r < rA- 

By Lemma [21 this implies that l < I a and r <Ia. 

First, we prove that for each j, with Ia < j < fA, the stack cannot contain the symbol aj, 
i.e., Pj = 0. Suppose, by contradiction, that the string 7 contains at least one occurrence 
of aj. This symbol must have been pushed on the stack in a previous iteration, with 
"work context" (/,r), for some l < f = j. Since the procedure never removes variables 
from the tree, the variable used to pump the tree in such a previous iteration is also 
in the tree T' . Moreover, the procedure chooses contexts in a nondecreasing order, so 
{l,r) < {Ia,'''a)- By Lemma [2] it turns out that either / < /a < '^A < ^ = J, or / < Ia, 
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f < rA, and j = r < Ia- It is easy to observe that in both the cases we get a contradiction. 
Hence, for any j with Ia < j < '''A, the stack does not contain the symbol aj, i.e., 
P/4+1 = P«A+2 = • • • = PvA-i = 0- This imphes that 7 = j''^ with 7" = Or*^ . . . a^'j^ and 

^/ _ P^A Pm 

J — "^rA • • • "-rn • 

Now, we observe the operations on the input and on the stack that are performed during 
the execution of the body of the foop: 

• the input factor aj^"^^ . . . a^l^^Or''^"'' . . . af^^"^ = t^f+Y • • • ^i'^ consumed; 

• the string 7" is popped off the stack; 

• the input factor v £ a^^ is consumed; 

• if Ia < rA then the string x € a*^ is saved on the stack (to be consumed later), 
otherwise it is consumed immediately. 

By summarizing, in the case Ia < fA, at the end of the iteration the scanned input prefix is 

a^^ . . . a^^ V = . . . a^''^ , the pushdown store contains the string xc^^^^ . . . , for each 
j , Ia < j < rA, kj = kj = rij, and enabled = i^{T'). With small changes we can deal with 
the case Ia = ^a- 

Case VA < r. 

By Lemma [21 this implies that I < Ia 1^ i"a ^ r. 

k 

If I A < TA then the consumed input prefix is a^ . . . cli^v and the pushdown store contains 

the string X7 = xor^'^ . . . Om", where Pr-A = Pr^+i = • • • = Pr-i = 0. At this point it is not 
difficult to verify that the statement of the Lemma is true. The subcase Ia = ta can be 
managed with easy changes. □ 

As a consequence: 

Theorem 2 The pushdown automaton M described by the previous procedure is an (m — 
l)~turn PDA accepting the language L{G). 

Proof: First, we show that the number of turns of the PDA M defined in the above 
procedure is at most m — 1. To this aim we count how many times the automaton can 
switch from push operations to pop operations. 

At each iteration of the while loop, the automaton can perform push operations. Pop 
operations are possible only by calling the macro consumelnputAndStack. This happens 
first in the while loop, when the condition r < holds true, i.e., when the new context 
{lAifA) is to the right of the previous context (/, r), and secondly after the end of the loop. 

Let (/i,ri), {l2-,i'2)., . . . {Ikifk) be the sequence of the contexts which in the computation 
make the above-mentioned condition hold true. Hence, 1 < Zi < . . . < < m, that 
implies k < m — 1. If k < m — 1, then the PDA M makes at most k < m — 2 turns in 
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the simulation of the while loop and one more turn after the loop. So the total number of 
turns is bounded by m — 1. 

Now, suppose that k = m — 1. This implies that 1^ = rn = r^. Before reaching the 
context {Ikifk), at most m — 2 turns can be performed. When the automaton switches 
to the new context {lk,rk) = {m,m), it can make pop operations, by calling the macro 
consumelnputAndStack(m). This requires one more turn. After that, the automaton 
can execute further iterations, using the same context (m, m). By reading the procedure 
carefully, we can observe that it never executes further push operations. Finally, at the 
exit of the loop, further pop operations can be executed (consumelnputAndStack). Hence, 
the total number of turns is bounded by m — 1. 

To prove that the language L{G) and the language accepted by the automaton defined 
in the above procedure coincide it is enough to observe that given a string z G L{G), 
the procedure is able to guess the tree Tq and the partial trees Ui, . . . ,Uk of Lemma [H 
recognizing in this way z. Conversely, using Lemma O it is easy to show that each string 
accepted by the procedure should belong to L{G). □ 

Corollary: Given an alphabet S = {ai, . . . ,am}, for any context-free grammar G 
in Chomsky normal form with h variables generating a letter-bounded language L C 
. . . aj!^, there exists an equivalent (m — l)-turn PDA AI with 2^^^^ states and 0(1) stack 
symbols. 

Proof: The most expensive information that the automaton defined in the previous 
procedure has to remember in its state are the m — 1 counters bounded by 2^~^, and the 
set enabled, which is a subset of V. For the pushdown store an alphabet with m + 1 
symbols can be used. With a small modification, the pushdown store can be implemented 
using only two symbols (one symbol to keep a counter pj and another one to separate two 
consecutive counters), and increasing the number of states by a factor m, to remember 
what input symbol aj the stack symbol A is representing. □ 

Using standard techniques, a PDA of size n can be converted to an equivalent CFG in 
Chomsky normal form with 0{'n?) variables. Hence, we easily get: 

Corollary: Each PDA of size n accepting a subset of . . . a*^ can be simulated by an 
equivalent (m — l)-turn PDA of size 

We now consider the situation when the given CFG is not necessarily in Chomsky normal 
form. 

Lemma 6 Given a context-free grammar G = {V, TijP, S), there exists an equivalent con- 
text free grammar G' = {V ,Ti, P' , S) such that the length of the right hand side of any 
production belonging to P' is at most 2, Var(G') < Symb(G), and the rank of G' coincides 
with the rank of G. 

Proof: Without loss of generality, we suppose that for each variable A in the set V there 
is a production with A on the left hand side. 
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The set V' of variables of G' is defined by considering all variables in the set V, plus 
some extra variables as defined below. The set of productions P' is defined as follows: We 
consider each production A X1X2 ■ ■ ■ Xm belonging to P, with Xi £ VUT,, i = 1, . . . ,m: 

• If m < 2, then the production A X1X2 ■ ■ ■ Xm belongs to P'. 

• If m > 2, then m — 2 new extra variables -Di, • • • > Dm-2 are introduced in the 
set V\ and the following productions are added to P': 

A XiDi,Di X2D2, . . . , Dm-3 Xm-2Dm-2, Dm-2 —> Xm-lXm- 

• No other productions are in P'. 

Note that the construction of P' is similar to the last step in the classical reduction of 
general context-free grammars to Chomsky normal form. 

It is easy to verify that G and G' generate the same language. Furthermore, by construc- 
tion, the right hand side of each production of G' has length at most 2. 

We recall that each variable of V appears on the left hand side of some production. 
Furthermore, for each production A ^ a, with |a| > 2, \a\ — 2 extra variables have been 
introduced in V' . Hence: 

Symb(G) = Yl (2 + |al) 

{A^a)eP s.t. |q|>2 

> #v+ Y (!"l-2) 

{A^a)eP s.t. |q|>2 

= Var(G'). 

Finally, it is immediate to observe that the definition of G' preserves the rank. □ 

Corollary: Given an alphabet S = {ai, . . . ,am}, for any context-free grammar G with 
Symb(G) = h and generating a letter-bounded language L C a* . . . a^, there exists an 
equivalent {m — l)-turn PDA M with 2*^^^) states and 0(1) stack symbols. 

Proof: At first, it can be observed that Lemma U] is true not only for CFGs in Chomsky 
normal form but also for CFGs whose productions have right hand sides of length at most 
2. Thus, all arguments in Section 3 are also true for such "normalized" CFGs. Owing to 
Lemma m we then observe that any CFG G can be converted to an equivalent CFG G' 
such that the length of the right hand side of any production belonging to G' is at most 2 
and Var(G') < Symb(G). With similar arguments as in Corollary [3] we obtain the claim. 
□ 

We will discuss and present the extension of Corollary [3] to the word-bounded case in 
Section [H 
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4 Reducing the Number of Turns 



By the results presented in Section [3l each context-free subset of . . . can be accepted 
by an (m — l)-turn PDA. In particular, Corollary [3] shows that the size of an (m — 1)- 
turn PDA equivalent to a given PDA of size n accepting a subset of . . . , is at most 
exponential in the square of n. 

In this section, we further deepen this kind of investigation by studying how to convert 
an arbitrary /c-turn PDA accepting a letter-bounded language L C a^ag . . . aj^ to an 
equivalent (m — l)-turn PDA. It turns out that the increase in size is at most polynomial. 
All PDAs we consider are in normal form. 

Let us start by considering the unary case, i.e., m = 1, which turns out to be crucial 
to get the simulation in the general case. In the following we consider a /c-turn PDA 
M = (Q,'E,T,6,qo, Z(),F) in normal form. Then we know that at most one symbol is 
pushed on the stack in every transition. 



Lemma 7 Let M be a PDA accepting a unary language L. Let L{qi, A,q2) be the set of 
all words which are processed by 1-turn sequences it of configurations starting with some 
stack height h in a state qi and having A as topmost stack symbol and ending with the same 
stack height h in some state q2 and having A as topmost stack symbol. Then, L{qi,A,q2) 
can be recognized by an NFA M' such that size{M') < v? and n = size{M). 



Proof: Consider the following CFG G with start symbol [gi,^, 52] having the following 
productions. Let p,p' , q,q' G Q, Z € T, and a € {a, e}. 



(1) [p, Z, q] ^ a[p', Z, q], if 6{p, Z, a) 3 {p\ -), 

(2) [p, Z, q\ ^ a\p, Z, q'], if 5{q' , Z, a) 9 (g, -), 

(3) [p,Z,q] ^ [p',Z',q'], if Z, e) 9 {p' ,push{Z')) and 5{q\Z',e) 3 {q,pop), 

(4) [p, Z, q]^ e,ifp = q. 



We want to describe how M' simulates a 1-turn sequence vr. We simulate the parts of vr 
with A as topmost stack symbol and stack height h with productions (1) and (2). The first 
part from the beginning up to the first push operation is simulated using productions (1). 
The second part starting at the end of the computation and going backwards up to the 
last pop operation is simulated with productions (2). We may change nondeterministically 
between productions (1) and (2). This is possible, since the input is unary. Having 
simulated the parts of vr with stack height h it is decided nondeterministically to proceed 
with simulating the parts of vr with stack height h + \. Productions (3) simulate a push 
operation and the corresponding pop operation. Then, productions (1) and (2) can be 
again used to simulate the parts of vr with stack height h + \. Now, we iterate this 
behavior and simulate all computational steps in vr while the stack height simulated is 
growing. Finally, we can terminate the derivation with productions (4) when the stack 
height has reached its highest level and all computational steps have been simulated. 
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Now, we construct an equivalent NFA M' = (Q', S, 5', (gi, ^, 92), -^') as follows: Q' = 
Q X T X Q and F' = {{q, Z,q) \ q & Q, Z ^ T}. For a E {a, e} the transition function 6' is 
defined as: 

(1) 6'{{p, Z, q),a) 3 {p\ Z, q), if 5{p, Z, a) 9 (p', -), 

(2) 6'{{p, Z, q),a) 3 {p, Z, q'), if 6{q', Z, a) 3 {q, -), 

(3) 6'{{p,Z,q),a) 3 {p',Z',q'), if 6{p,Z,e) 3 {p' ,push{Z')) and 6{q',Z',e) 3 {q,pop), 

It is not difficult to observe that T(M') = L{G). □ 

Corollary: Let M be some 1-turn PDA accepting a unary language L. Then, an equiv- 
alent NFA M' can be constructed such that size{M') < + 1 and n = size{M). 

Proof: We can use the above construction, but additionally have to guess in a first step 
in which state a computation ends. Therefore, we add a new start symbol S and add 
productions S — > [qo, Zq, qj] for all qj G F. For the NFA construction we add a new initial 
state qQ and the following rules 5'{qQ, e) 3 {qo, Zq, qj), for all g/ G F to M' . 

It is easy to observe that the parts of vr with stack height one can be again simulated with 
productions (1) and (2). The remaining part of the simulation is identical to the above 
described construction. □ 

A subcomputation vr' is called strong of level A if it starts with some stack height h and 
topmost stack symbol A, ends with the same stack height h and topmost stack symbol A, 
and in all other configurations of tt' the stack height is greater than h. 

Lemma 8 Let M be some k-turn PDA accepting a unary language L. Let L{qi,A,q2) be 
the set of all words which are processed by sequences vr of strong computations of level A 
which, additionally, start in some state qi and end in some state q2- It can be observed 
that all words in L(qi,A,q2) are accepted with j < k turns. Then, L(qi, A,q2) can be 
accepted by an NFA M' such that size{M') G 0(n2Li°S2iJ+2) and n = size{M). 

Proof: The construction is very similar to the above described construction. Addition- 
ally, we store the number of turns, which have to be simulated, in the fourth component 
of the variables. There are two cases how vr may look like. In the first case (type I, cf. 
Fig. [H left) TT consists of at least two strong computations of level A. We introduce a new 
production type (5) which is used to decompose a sequence of strong computations with 
i turns into two subsequences with ii and i2 turns, respectively. A resulting subsequence 
is then either again of type I and can be again decomposed with the new productions (5), 
or it is of type II, i.e., it consists of one strong computation of level A (cf. Fig. [H right). 
If this computation is 1-turn, it can be simulated with the productions (1) to (3) and 
finished with productions (4). If it is not 1-turn, we can reduce it to a sequence of strong 
computations of level B by using the productions (1) to (3). Then, the same analysis can 
be made for strong computations of level B. 

The formal construction of the CFG G is as follows. We consider the start symbol 
[qi, A, q2, j] and the following productions. Let p,p',q,q' G Q, Z € T, and a G {a, e}. 
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(1) \p,Z,q,i] a[p',Z,q,i], if6{p,Z,a) 3 {p',~), 

(2) \p,Z,q,i] a\p,Z,q',i], if 6{q',Z,a) 3 (g, -), 

(3) \p,Z,q,i] ^ \p',Z',q',i], if <5(p,Z,e) 3 (p' ,push{Z')) and S{q',Z',€) 3 {q,pop), 

(4) [p,Z,q,l] ^e,iip = q, 

(5) [p, Z, q, i] [p, Z, r, ii] [r, Z, q, 12], for all r G Q and ii, ^2 > 1 such that ii + 12 < i- 

It can be shown by an induction on the number of turns that G generates L{qi,A, ^2)- 

Wc can observe that all productions arc right-linear except for productions (5). Since 
the last component of a variable [p, Z, r, i] is reduced in every application of a production 
(5), we can conclude that (5) is applied at most j — I times. Thus, every sentential form 
contains at most j variables. Thus, we can construct some NFA simulating the single 
derivation steps by representing all variables of a sentential form in its state. A rough 
estimation of the number of states is then 

We now want to do some finer estimation and will obtain 2(j|(5t^|r|)L^°S2iJ+i as upper 
bound. To this end, we observe that a simulation of a production (5) increases the number 
of variables in the current state of the NFA by one and that a simulation of a production 
(4) at the end of some 1-turn computation decreases the number of variables by one. Thus, 
our strategy is to apply productions of type (4) as soon as possible. Now, whenever an 
application of a production (5) has replaced a variable [p, Z, q, i] by two variables [p, Z, r, ii] 
and [r, Z,q,i2]: then the derivation of the variable with the lower number of remaining 
turns is simulated. This makes sure that the total number of variables in a state is as small 
as possible. The worst case which can occur in this context is that in every application of 
a production (5) the number of turns is divided into two equal parts. This may happen 
at most [log2 j\ + 1 many times. Thus, the size of the NFA can be estimated as follows 



A. ^-^i^i 1^1^ - j\Qm-i jiQnri-i 



1=1 

|2 



= _L§i^^(_^-|Q|2|r|)Liog2jJ+i < 2(i|QP|r|)L'°g2JJ+i G 0(n2Li°g2^>2) 
j\Q\^\T\ - 1 

Finally, it can be observed that the last component in a tuple [p, Z, q, i] may be removed, 
since the maximum number of possible turns only depends on p, Z, q. This may save the 
constant factor j in the above estimation. □ 

Corollary: Let M be some fc-turn PDA accepting a unary language L. Then, an 
equivalent NFA M' can be constructed with size{M') G 0(n^l-iog2 ^^^^ ^ ^ size{M). 
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Figure 1: The two cases arising in the construction in Lemma [8j 

Proof: Observe that an accepting computation in M is a sequence of strong computations 
of level Zq starting in qq and ending in some accepting state. □ 

Now, we are able to consider the general case, i.e., m > 1 and start with some definitions. 
Given the alphabet S = {oi, . . . , am}, we define the set n(m) as follows 
n(m) = {aj I 1 < i < m} U {a^aj | 1 < i < j < m} 

It is easy to show that the cardinality of n(m) is Let w G a*a2 . . .a^ be some 

string. Then tti{w) denotes the projection to the first symbol of w and TTr{w) denotes the 
projection to the last symbol of w. For example, let w = 020304. Then, tti{'w) = 02 and 

7Tj.{w) = O4. 

Theorem 3 Let M be some k-turn PDA accepting a letter-bounded language L C 0^02 . . . o* 
Then, an equivalent (m — l)-turn PDA M' can he constructed such that size{M') G 
©(m^n^LiogafcJ+S) and n = size{M). 

Proof: It has been shown in the previous section that any L C 0^02 . . . can be 
accepted by an (m — l)-turn PDA. If L is accepted by a A:-turn PDA such that k > m — 1, 
then some turns are in a way "not necessary." We will show in this proof that this finite 
number of additional turns takes place within unary parts of the input, i.e., while reading 
some input a* with 1 < i < m. Then, with the help of the construction of LemmaO these 
parts can be accepted by NFAs and hence do not affect the stack height in the construction 
of an (m — l)-turn PDA accepting L. 

The construction is similar to the constructions of the previous two lemmas. Additionally, 
we introduce a fifth component of the variables in which some element s G n(m) is stored. 

s = ai {1 < i < m), this means that the variable can only produce terminals Oj. If 
s = OiUj (1 < i < J < m), then such a variable can only produce sentential forms which 
start with terminals Oj and end with terminals Oj. 

For the construction we first consider a context-free grammar G with start symbol S and 
having the following productions. Let p,p' , q,q' G Q, Z £ T, a,b G {oi, . . . , am} such that 
ab G n(m), a G {a, e}, and b G {b, e}. 

(1) [p,Z,q,i,ab] ^ a[p' , Z,q,i,ab], if6{p,Z,a) 3 {p',-), 

(2) [p,Z,q,i,ab] [p, Z,q' ,i,ab]b, if 6{q' , Z,b) 3 (g, -), 

(3) [p,Z,q,i,ab] [p' , Z' ,q' ,i,ab], if 6{p, Z, e) 3 {p' ,push{Z')) as well as 5{q',Z',e) 3 
{q,pop), 
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(4) Ip, Z, q, 1, u] ^ e, if p = q, for all u G n(m), 

(5) [p, Z, g, i, a6] — > [p, Z, r, ii, u] [r, Z, g, ^2, i'], for all r G Q, ii, ^2 > 1 such that ii+i2 < i, 
\u\ > 1, > 1, iTi{u) = a, and iTriv) = 6, 

(6) [p, Z, q, i, ab] ([p, Z, r, ii,a], [r, Z, q, Z2, v]), for all r £ Q, ii,i2 > 1 such that ii + 
«2 < i, T^r{v) = b, 

(7) Ip, Z, q, i, ab] — > ([p, Z, r, ii,u], [r, Z, q,i2,b]), for all r £ Q, ii,i2 > 1 such that «i+i2 < 
i, 7ri(n) = a. 

(8) S [qo, Zq, qf,i, u] for all g/- G F, 1 < i < k, and u G n(m), 

The productions (1) to (4), and (8) are defined similarly to the previous constructions. In 
the productions (5) to (7), a computation with i turns is decomposed into two subcompu- 
tations with ii and 12 turns, respectively. Additionally, we differentiate whether we obtain 
subcomputations producing only one type of terminals or not. The former case is handled 
with productions (6) and (7), for the latter case we have the productions (5). 

Moreover, we have to define productions for variables of the form [p, Z, q, i, a] with a G 
{oi, . . . , am}- Such variables are from now on called "unary" variables. Owing to Lemma[8] 
we know that the language L{p, Z, q) can be accepted by some NFA A having at most 
2(i|Q|2|r|)Li°S2iJ+i states. This NFA can be converted to some right-linear grammar Ga 
with at most 2(z|Qp|r|)L'°S2«J+i variables. Now, the productions for a unary variable 
[p, Z, q, i, a] with a G {ai, . . . , am} are defined to be the productions of the corresponding 
right-linear grammar Ga- 

In order to finally get a PDA making at most m — 1 turns, we have introduced in the 
productions (6) and (7) some special variables of the form {\p, Z,q,i, a],[p' , Z' ,q' ,i' ,v]) 
which, at a first glance, are not natural and intuitive. 

To derive such a variable {[p,Z,q,i,a], \p' , Z' ,q' ,i' ,v]), we first derive its first component 
[p, Z, q, i, a] with the above defined unary productions. Observe that the resulting produc- 
tions are right-linear. For variables having the form (e, [p' , Z' , q' , i' , v]) we add productions 
{e,\p' , Z' ,q' ,i' ,v]) — > [p' , Z' ,q' ,i' ,v]. The remaining second component \p' , Z' ,q' ,i' ,v] is 
then derived with the productions (1) to (7) if > 1 and with the above defined produc- 
tions otherwise. 

To derive a variable {[p, Z, q,i,u\,\p' , Z' ,q' ,i' ,b]), we first derive its second component 
\p' , Z' ,q' ,i' ,b] with the above defined unary productions. Observe that the resulting pro- 
ductions are left-linear. We add productions of the form {\p,Z,q,i,u],€) — > [p, Z,q,i,u\ 
and the remaining first component [p, Z, q, i, u] is then derived with the productions (1) to 
(7) if |u| > 1 and with the above defined productions otherwise. We would like to remark 
that variables of the form ([p, Z, q, i, a], [p' , Z', q' , i' , b]) are treated as in the first case, i.e., 
we start to derive the first component with unary productions and then derive the second 
component. 

It can be observed that G generates T{M). Since all productions in G are linear except 
those of type (5), the number of variables occurring in a sentential form can only be 
increased by applications of productions of type (5). It can be observed that the maximum 
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number of variables introduced by productions of type (5) is bounded by the maximum 
number of decompositions of the string aia2 ■ ■ - CLm into substrings wi,W2-, ■ ■ ■ ,wi such that, 
for 1 < t < I, wt = aiflj with 1 < i < j < m and 1011112 ■ ■ - Wi G 0.10,2 ■ ■ ■ a^. It is easy to 
show that I < m — 1. Thus, every sentential form contains at most m — 1 variables which 
implies that G is ultralinear of rank m — 1. 

We next convert G to some equivalent one-state PDA M' using the standard conversion 
algorithm as given for example in [8]. Since the rank of G is m — 1, it can be observed 
that the maximum number of variables on the stack is bounded by m — 1. Furthermore, 
any decreasing of the stack starts by deleting some variable from the stack. This action 
corresponds to an application of some production of type (4) in the grammar. Thus, the 
number of turns in M' is bounded by the number of possible applications of productions 
of type (4) which is in turn bounded by the number of variables on the stack. Thus, the 
number of turns in M' is bounded by m — 1. 

We now want to estimate the size of M' which is bounded by the number of variables of 
G and the size of the alphabet E. The number of variables of type [p, Z, q, i, u] is bounded 
by 0(A;m^|Qp|r|) and the number of variables resulting from the unary productions is 
bounded by m(2A:|Qp|r|)Li°S2^'J+i. We now want to estimate the number of variables 
resulting from variables of type {\p, Z,r,i,u\, [r, Z, q, j, v] ) . Observe that in each such vari- 
able |n| = 1 or \v\ = 1, respectively. Since these unary parts are derived first, the number 
of variables resulting is bounded by the product of the number of variables resulting from 
unary productions and of the number of variables of type [p, Z,q,i,u]. Thus, the num- 
ber of variables resulting from variables of type ([p, Z,r,i,u], [r, Z,q,j,v]) is bounded by 
0(m3(2A:|Q|2|r|)Li°g2fcJ+2) which implies \M'\ G 0(m3(2A:|Q|2|r|)Li°g2'=J+2). 

Finally, we convert M' to a PDA in normal form. This may cause at most an additional 
quadratic blow-up. Thus, we obtain 0(m*^(2/c[Q|2|r|)2Liog2fcJ+4) = o(^6^4Liog2 fcJ+8) 

upper bound. □ 

Corollary: The trade-offs between finite-turn pushdown automata that accept letter- 
bounded languages are at most polynomial. 



5 Lower Bounds 

In this section we show the optimality of the simulation of grammars generating letter- 
bounded languages by finite-turn PDAs (Corollary [3]), and of some other simulation results 
presented in the paper. Even in this case, the preliminary investigation of the unary case 
will be useful to afford the general case. 

Theorem 4 For any integer n > 1, consider the language Ln = {a^"}- 

(1) Ln can he generated by some CFG in Chomsky normal form with n + 1 variables. 

(2) Every NFA accepting Ln needs at least 2" states. 
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(3) For each /c > 0, every k-turn PDA accepting Ln is at least of size 2'^" for some 
constant c > and any sufficiently large n. 

Proof: To prove (1) it is enough to observe that L„ can be generated by the grammar 
G with the following productions: 

S AiAi 
Ai A2A2 

An a 

The proof of (2) is trivial. 

Finally, to prove (3) consider a A;-turn PDA M of size s{n) accepting L„. Due to CorollaryH] 
we can construct an equivalent NFA of size s'{n) < Hs^{n) for suitable constants H,K. 
Since s'(n) > 2"", we obtain s(n) > for some other constant H' . □ 

From Theorem's), it turns out that for each integer m the simulation result stated in 
Corollary [3] is optimal. The witness languages are unary. Hence, they can be also accepted 
by "simpler" devices, i.e., finite automata or PDAs with less than m — 1 turns. We now 
show the optimality in a stronger form, by exhibiting, for each integer m, a family of 
witness languages that cannot be accepted with less than m — 1 turns. 

Theorem 5 Given the alphabet E = {ai, . . . ,am}, for any integer n > 1 consider the 
language 

J^n — 0,2 • • • I ™0 — ^ ^ . . . , 71^-1 ^ i|. 

(1) Ln is generated by some CFG in Ghomsky normal form with n + 4m — 3 variables. 

(2) Ln is accepted by an (m — l)-turn PDA of size 

(3) For each integer k > m — 1, every k-turn PDA accepting Ln is at least of size 2^^ 
for some constant c > and any sufficiently large n. 

(4) Ln cannot be accepted by any PDA which makes less than m — 1 turns. 
Proof: Consider the grammar G with the following productions: 

• Bi ^ C1B2, . . . , Bm-3 Cm-3Bm-2, Bm-2 —>■ Cm-2Cm-l 

• Ao^ AiAi,Ai A2A2, An-l AnAn, An ^ Oi 

• Ci ^ DiEi I DiDi+i,Ei CiDi+i, for i = 1, . . . , m - 1 



22 



• Di ^ Ui, foT i = 1, . . . , m. 

It is possible to verify that this grammar G generates the language L„. In particular, 
observe that from each Cj we can derive terminal strings only of the form a*a-_,_]^, with 
t > 1, and from we can derive only the string a\ . By observing that we can use the 
same variable for An and Di, we easily conclude that the total number of variables is 
n + 4m — 3. This proves (1). Furthermore, as an easy consequence, applying Corollary [3] 
we get (2). 

Now, given k > m — 1, suppose to have a A;-turn PDA of size s accepting L„. By replacing 
each move consuming a symbol Oj, where i > 1, with an e-move, we get another fe-turn PDA 
with s states accepting the language L'^ = {a"^ | ni > 2"}. Using a slight modification of 
Theorem m we can get that s > 2*^" for some constant number c > and any sufficiently 
large n. This proves (3). 

We finally prove (4). To this aim, we first prove that each context-free grammar which 
generates the following language L must have rank at least m — 1: 

L = {a^ a^^+"^ . . . a:;"!.f +"-^a;^-i | m > 1, . . . , n„,_i > 1}. 

Let G be a grammar with h variables which generates L. We can suppose that the right 
hand side of each production of G has length at most 2 (by Lemma [6] this restriction 
preserves the rank, furthermore it can be easily seen that Lemma [U used in the following, 
holds even for there grammars). Let H = 2^ and z = a^a^ ■ ■ ■ a^_ia^ € L. 

Given a derivation tree T : S ^ z, consider an integer A; > 0, derivation trees Tq, . . . ,Tk 
of strings zq, . . . , z^, partial derivation trees Ui : Ai ^ ViAiXi, i = 1, . . . ,k, according to 
Lemma m For i = let border{Ai) = {li,ri). Note that ri = li + 1, otherwise, 

by pumping Tj_i (which generates the string Zi-i E L) with the partial tree C/j, the 
resulting string Zi should not belong to L. Considering the definition of the relation < 
between borders and Lemma [2l this easily implies that for each pair of variables Ai , Aj G 
{^1, . . .Ak}, either Ai and Aj have the same border or they lie on two different paths 
from the root of the tree T. We now prove that for each j = l,...,m — 1, there is a 
variable Ai- G {Ai, . . . , A^,} such that border{Ai.) = + 1), obtaining in this way m—1 
variables belonging to different paths from the root of T. 

Suppose, by contradiction, that there is an index j such that (J, j + 1) ^ {border{Ai) \ i = 
1, . . . , k}. Hence, there is an index r, 1 < r < k, such that 0</i<...<^r^J— 1 and 
Ir+i > J (r = 1 in the case J = 1). The pumping process described in Lemma [H starts from 
the tree Tq, which generates a string zq = a^ia2^+"2 _ _ _ a'^Ji'^''"'-' a^"'. The number 
of occurrences of the letters ai,...,aj can be incremented only by pumping with the 
partial trees Ui, . . . ,Ur, while the number of occurrences of the letters aj+i, . . . ,am only 
by pumping with the partial trees Ur+i, . . . ,Uk- Hence, the terminal string generated at 
the rth step should be 

Zr — a2 ... "j+i • • • "m-1 "m 

For i = 1, . . . , J, let Oj be the number of occurrences of letters Oj and Oj+i added during the 
pumping process, which leads from zq to Zr, by those partial trees among Ui, . . . ,Ur such 
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that the borders of their roots coincide with (i, i + It is easy to verify that ai = H — Ui, 
for i = 1, . . . , J. Furthermore, by our choice of J, it turns out that aj = 0. This impUes that 
rij = H. This is a contradiction, because raj < |zo| < H. Hence, we finally get that the 
tree T contains m — l variables , • • • , ^i^-i G {^i , ■ ■ ■ ,^k} which lie on different paths 
from the root, i.e., there is a derivation of the form S ^ aiAi-^a2 ■ ■ ■ am-iAi^_-^^am ^ z. 
Thus, we conclude that the rank of the grammar G is at least m — l. 

Using a slight modification of the construction given in the proof of Theorem [3l we can 
show that from a fc-turn PDA it is possible to get an equivalent grammar of rank k. This 
implies that if a A;-turn PDA accepts L then k > m — 1. 

To complete the proof, we observe that given a PDA M accepting the language L, we 
can build a PDA M which accepts L by working in two phases: In the first phase M 
simulates the moves of M from the initial configuration, as long as M consumes the input 
af". In this phase, each move consuming the symbol oi is replaced by an e-move (an 
internal variable counts, up to 2", the number of these moves). In this way, M is able 
to reach, without consuming any input symbol, every configuration reachable by M by 
consuming the input prefix af" . At this point, the second phase can start. In this phase 
M makes exactly the same moves as M. It is easy to see that M accepts the language L. 
Furthermore, if the given PDA M is fc-turn, M is fc-turn, too. 

In conclusion, having proved that L cannot be accepted by A;-turn PDAs with k < m — 1, 
we can conclude that L cannot be accepted by fc-turn PDAs with k < m — 1, too. □ 

Remark that we have considered so far only CFGs in Chomsky normal form and the mea- 
sure Var. It is easy to observe that we also obtain exponential trade-offs when considering 
the measure Symb. This shows that the result of Corollary [3] is also optimal. Since L^i 
can be accepted by a PDA of size 0{n), we obtain that the result of Corollary [3] is nearly 
optimal. 

We complete this section by considering again the unary case. In particular, we prove that 
the upper bound stated in Corollary [4] is tight. 

Theorem 6 Consider the language family 

L'^ = {a^ \t>OAt = mod nAt = mod n + 1} 

for natural numbers n > 2. Then each L'^ can be accepted by some 1-turn PDA of size 
2n + 1, but every NFA accepting L'^ needs at least v?' + n states. 

Proof: A 1-turn PDA accepting L'^ starts with checking whether the length of the input 
is divisable by n in its states. At the same time, the input is stored in the stack. Then 
the PDA guesses that the whole input is read and checks whether the length of the input, 
which is stored on the stack, is divisable by n + 1. Finally, the PDA accepts if the whole 
input is read, divisable by n and n + 1, and the stack is empty. Otherwise, the input is 
rejected. It can be observed that such a PDA is 1-turn, has one stack symbol (apart from 
Zq) and has 2n -|- 1 states. Since gcd(n,n + 1) = 1, we can apply a result from [7j and 
obtain the latter claim. □ 
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6 Word-Bounded Languages 



In this section we study how to extend our results from the letter-bounded case to the word- 
bounded case. The idea is that of reducing the latter case to the former one. To this aim, 
a large part of the section is devoted to prove that for fixed m words ■wi,W2,---, Wm G S*, 
m symbols ai, . . . , and the homomorphism cp associating with each symbol the string 
Wi, i = 1, . . . ,m, and for each context-free grammar G = {V, S, P, S) in Chomsky normal 
form generating a subset of wlw2---w^, we can get another context-free grammar G 
whose number of symbols is linear in the number of symbols of G, namely Symb(G) = 
0(Symb(G)), and such that L{G) = (j)^^{L{G)), i.e., for all integers ki,...,km > 0: 
a^i . . . a^- G L{G) if and only if w^K . . w'^ G L{G). 

The construction is given in two steps: first we introduce a new grammar G' = {V, E, P', S') 
equivalent to G. In such a grammar, the variables of G are marked with some indices which 
are useful to recognize where, in a derivation, a variable can produce the first symbol of 
one of the Wi's. This will be useful to get from G', in a second step, the required grammar 
G. 

We start by considering the following set of variables: 

V' = {[A,i,l,r,j] \ AeV,l <l <r <m,0 <i < \wi\,0 < j < \wr\}. 

The definition is given in such a way that a variable [A,i,l,r, j] can generate all terminal 
strings of the form a(3^ generated by A, such that P & w'f . . . w*, a is the suffix of wi 
which starts in position i + 1 and 7 is the prefix of Wr which ends in position j. If Z = r, 
furthermore, the variable [A, i, I, l,j] will be able to generate the factor of wi from position 
i + 1 to position j if A is able to do this. 

To this aim, we define the following productions: 

(1) [A,j-l,l,l,j]^a, 

for all A € V such that ^ ^ a is a production in P, 1 < I < m, 1 < j < \wi\, and 
a = wij, i.e., a is the symbol in position j of wi. 

(2) [A,i,l,r,j] ^ [B,i,l,h,k][C,k,h,r,j], 

for all A, B,C £ V such that A BC is a production m. P , 1 < I < h < r < 
< i < \wi\, d < k < \wh\, and < j < \wr\- 

(3) [A,i,l,r,^] ^ [A,i,l,r,\wr\l 

for all 1 < Z < r < m, < i < ju)/!. 

(4) [A,\wi\,Ur,j] ^ [^,0,Z,r,i], 

for all 1 < Z < r < m, < J < \wr\- 

(5) [^,i,/,r,0] -> [^,i,/,/i,0], 

for all 1 < / < /i < r < m, < i < I?/;;!. 

(6) [A,\wi\,l,r,j] -> [A,i,\wh\,h,r,i], 

for all 1 < I < /i < r < m, < J < \wr\- 
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The initial symbol S' of G' is [S, \ wi\, l,m,0]. 

The following lemma states the main property of the variables of the grammar G' and it 
is crucial in order to prove that G' is equivalent to G: 

Lemma 9 For each variable [A,i,l,r,j] of the grammar G' above defined and for each 
string w eT,*, it holds that [A,i,l,r,j] w if and only if A 4>g w, and 

• either w = . . . Wi^\yj^\l3wr,i ■ ■ ■ Wrj, with /? G iW;* . . . w*, 

• or I = r, i < j, and w = wi^i+i . . . wij. 

Proof: We first prove the only if part. Let s > 1 be an integer such that [A, i, I, r, j] 
w. The proof is by induction on s. 

If s = 1 then the derivation consists only of the production [A,j — 1,1,1, j] wij. This 
implies that i = j — 1, I = r, and A 4>g u^ij- 

If s > 1 then we have to consider several possibilities depending on which kind of pro- 
duction is used in the first step of the derivation. (The following list refers to the above 
defined productions (2), . . . , (6):) 

(2) [A,i,l,r, j] -^[B,i, I, h, k] [C, k,h,r, j] : 

Hence [B,i,l, h, k] =^Qr w' , [C, k, h,r,j] =^g' "w" where w = w'w" . We observe that, 
by induction hypothesis, B w' , C w" , and then A w. The proof can be 
easily completed by considering the following subcases: 

• w' = wi^i+i . . . wi^\^^\(3'wh,i . . . Wh,k, w" = Wh,k+i ■ ■ ■ Wh,\wh\/^"^^'rd ■ ■ ■ Wr.j , whcre 
P' € wl . . . w*j^ and /?' G wl^. . .w*. By choosing /3 = P'wiP" , it turns out that 

P ewl...W* and W = Wi^i+i . . . W^y^^lPWr,! ■ ■ ■ Wr,j. 

• l = h = r, i<k<j,w' = wi^i+i . . .wi^k, and w" = wi^k+i ■ ■ -wij. Then 

W = Wl^i+i . --Wlj. 

• I = h, i < k, w' = wi^i+i . . .wi^k, and w" = wi^k+i ■ ■ ■ wi^\^^\Pwr,i ■ ■ ■ Wrj, or 
h = r, k < j, w' = wi^i+i . . . Wi^\^^^\Pwr,i ■ ■ ■ Wr,k, and w" = w^^k+i ■ ■ ■ Wrj. In 
both these cases, wc got that w = . . . wi^^yj^^Pwr^i ■ ■ ■ Wrj- 

(3) [A,i,Lr,0] ^ [A,i,l,r,\wr\]: 

Hence [A, i, /, r, jitJ^ |] c w. By induction hypothesis we get that A ^g w. If 
w = wi^i+i . . . wi^\y,^\P'wr,i . . . Wr,\yj^\, with P' ^ w* . . . w*, then w = wi^i+i . . . wi^\yj^\P, 
with P = P'wr G Wi . . .w*. Otherwise, / = r and then w = wi^i^i . . . w^^yj^^P, with 
P = e. 

(4) [A,\wi\,l,r,j] ^ [A,0,l,r,j]: 

This case is similar to the previous one. 

(5) [A,i,l,r,0] [A,i,l,h,0]: 

Hence [^4, i, I, h, 0] =^ g' ^ and, by induction hypothesis, A w and w = li^j^j+i . . . w^^. 
for some P & Wi . . . w\. From h < r, \i turns out that P E Wi . . .w*. 
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(6) [A,\wi\,l,r,j] [A,\wh\,h,r,j]: 

Hence [A,\wh\,h,r,j] *=4>g" w and, by induction hypothesis, A w and w = 
fiwr,! ■ ■ ■ Wr,j for some 13 ^ w'f^. . .w*. Prom Z < r, it turns out that P E wf . . .w*. 



We now prove the if part. Even in this case the proof is by induction on the number s of 
the steps of the derivation A w under consideration. 

If s = 1 then = 1 and A ^ w must be a production in P. We have to consider two 
cases: 



• w = wi^i+i . . . WQyj^^Pwr,! . . . Wr,j, with P ew^ ...w*: 
Since l^l = 1, it turns out that: 

— either w = ^i'^,|^y;|, P = e, i = \wi \ ~ 1, and j = 0, or 

— w = Wr,i, (3 = e,i = \wi\, and j = 1, or 

— (3 = w = Wfi, for some I < h < r, i = \wi\, and j = 0. 

In the first case, the grammar G' should contain the production [A, \wi\—l, I, I, \wi\] — > 
w. Hence, using productions (5), (3), and (1), wc get: 

[A,i,l,r,j] = [A,\wi\-l,l,r,0] ^g' [A,\wi\ - 1,1,1,0] 

^G' [A,\wi\-'^,Ul,\wi\]^G'W- 

The second case is similar. In the third case, the grammar G' contains, by construc- 
tion, the production [A,0,h,h,l] — ^ w. Hence, using productions (6), (5), (3), (4), 
we get the following derivation: 

[A,i,l,r,j] = [A,\wi\,l,r,Q] =^g' [AAwh\,h,r,0] 

=^G' [A, \wh\,h,h, 0]^G' [A, \wh\,h,h, 1] 
^G' [A,0,h,h,l]^G'W. 



• l = r,i < j, and w = wi^i+i . . . wij: 
This case is trivial. 



Now suppose s > 1. The first production applied in the derivation of w should he A ^ BC, 
for two variables B and C, such that B ^g w' , C ^g w" , and w = w'w" . As before, the 
proof is divided into two cases: 



• w = wi^i+i . . . wi^i^^i(3wr,i ■ . . Wrj, with ^ E w* . . . w*: 
We have to consider the following three possibilities: 

- w' = Wi^i+i . . . WQ^^lf3'Wh,l . . . Wh,k, W" = Wh,k+1 ■ ■ ■ 'Wh,\wh.\P"'^r,l ■ ■ ■ WrJ, for 

some l<h<r, 0<k< \wh\, (3' G w*...wl, and (3" G wl...w*. By 
the inductive hypothesis, we obtain [B,i,l,h,k] ^g' w' , [G,k,h,r,j] ^g' w" . 
Hence, [A,i,l,r,j\ ^g' w. 
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- w' = wi^i+i . . . wi^k, w" = wi,k+i ■ ■ ■ wi,\wi\Pwr,i ■ ■ ■ Wr,j, for some i < k < Iwil, 
P G Wi ...w*. By the inductive hypothesis, we obtain [B,i,l,l,k] =^c' w', 
[C,k,l,r,j] 4>G" w". Hence, [A,i,l,r,j] ^q' w. 

- w' = wi^i+i . . . wi^i^jif3wr,i . ■ ■ Wr,k w" = Wr^k+i ■ ■ ■ Wj-j, for some < A; < j, 
P G Wi ...w*. By the inductive hypothesis, we obtain [B,i,l,r,k] w' , 
[C,k,r,r,j] 4>G' w". Hence, [A,i,l,r,j] w. 

• I = r, i < j, and w = . . . wij: 

There exists an index k, i < k < j, such that w' = wi^i^i . . . wi^k and w" = 
w = wi^k+i ■ ■ ■ wij. By the induction hypothesis, we obtain [B,i,l,l,k] w' 
and [C, ^g' w" . Hence, [A,i,l,l^j] w. 

□ 



Theorem 7 The context-free grammar G' is equivalent to G. Furthermore, for each string 

with Xk G S for k = 1, . . . ,n, there exists a derivation S' =^g' 
[Ai,ii - 1,^1, . . . [An.in - IjlnJnJn], for Suitable indices k, ...,/„, ji, . . . , j„, such 
that [AkJk - '^JkJkJk] Xk, fork = l,...,n. 

Proof: Given x (z wl . . -w^, as a consequence of the previous lemma, it is immediate 
to conclude that S ^g x if and only if S' ^g' x. Each symbol x^ of the string x will be 
generated in a derivation : S' ^g' 2; using a production of the form [^4^, jfc — 1, Zfc, 4, ifc] — > 
Xfc, for suitable Ai^^ji^^l^. Hence, from the derivation 6 it is easy to get a derivation 

e' : S' ^G' [AlJl - jl] . . . [An,jn - IJnJnJn]- □ 

At this point we are able to define the grammar G. The set of variables is the same as 
of G' , the set of production P contains all the productions in P' , with the exception of 
each production of the form [A,j — 1, 1, l,j] wij, which is replaced with the production 
[A,j — l,l,l,j] — > ai for j = 1, and with the production [A,j — ^ e in the 

other cases. It is immediate to prove that the language generated by G coincides with 

rHHG)). 

Hence, we get the following result: 

Corollary: Given m strings wi, . . . , Wm, m symbols oi, . . . , Um, and the homomorphism (j) 
associating with each symbol aj the string Wi, i = 1, . . . ,m, for each context-free grammar 
G generating a word bounded language L ^ . . -w^, there exists another context-free 
grammar G such that L{G) = ^-^{L{G)), and Symb(G) = 0(Symb(G)). 

We are now ready to extend Corollary [3] to word-bounded languages: 

Theorem 8 Given m strings wi, . . . , Wm, for any context-free grammar G with Symb(G) = 
h and generating a word-bounded language L C w^.-.w^, there exists an equivalent 
(m - l)-turn PDA M with 2<^('*) states and 0(1) stack symbols. 
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Proof: By Corollary [6l given m symbols oi, . . . , am, we can get a context-free grammar 
G with Symb(G') = 0(Symb(G)), which generates the language (j)~^{L{G)). Using Corol- 
lary El we are able to find an (m - f )-turn PDA M with 2'^(Symb{G)) g^^^^^g ^^^^ 
stack symbols, recognizing cj)^^{L{G)). 

From M we now define a PDA M accepting L: M should simulate M with the difference 
that a move of M consuming the input symbols a, is simulated by a sequence of moves 
of M consuming the input factor Wi. It is easy to implement this with 2^-^^^^ states and 
without increasing the number of stack symbols. □ 

Because the letter-bounded case is a special case of the word-bounded case, by the results 
presented in Section [U the upper bound presented in Theorem [8] is tight. 

Also Corollary H] can be extended to the word-bounded case. 

Theorem 9 The trade-offs between finite-turn pushdown automata that accept word-bounded 
languages are at most polynomial. 

Proof: Given m strings wi, . . . ,Wm, let M be some fc-turn PDA accepting a word- 
bounded language L C wlw2 ■ ■ ■ w^- We consider m letters ai, . . . , and the morphism </> 
associating with each aj the string Wi, i = 1, . . . , m. From the given PDA M, a /c-turn PDA 
M accepting (j)~^{L) can be defined by simulating M step by step, with the only difference 
that the input tape of M is replaced by a buffer of length max{|t(;j| \ i = 1, . . . , m}. When 
M has to simulate a move of M that reads an input symbol, it uses the next symbol from 
the buffer. However, if the buffer is empty, then M reads the next symbol aj from its 
input tape and puts the corresponding string Wi in the buffer. Note that the size of M is 
polynomial in the size of M. This is essentially the construction for inverse homomorphism 
as is given, e.g., in [8j. 

Now, using Theorem [31 from M it is possible to get an equivalent (m — l)-turn PDA M', 
which is still polynomial in the size of M. Finally, using the same construction outlined 
in the proof of Theorem [HI this last PDA can be converted in another (m — l)-turn PDA 
M' , accepting the original language L, whose size is polynomial in the size of M. □ 



7 Conclusion 

In this paper, we have considered context-free grammars generating letter-bounded as 
well as word-bounded languages. We have shown that such languages can be accepted by 
finite-turn pushdown automata. Furthermore, we have given constructions for converting 
context-free grammars which generate bounded languages to equivalent finite-turn push- 
down automata as well as minimizing the number of turns of a given pushdown automaton 
accepting a bounded language. The resulting trade-offs concerning the size of description 
of the corresponding context-free grammars and pushdown automata have been shown to 
be exponential when starting with an arbitrary context-free grammar or an arbitrary push- 
down automaton. When starting with a finite-turn pushdown automaton, a polynomial 
trade-off has been obtained. Both trade-offs are in strong contrast to the non-bounded 



29 



case where non-recursive trade-offs are known to exist. Moreover, the existence of non- 
recursive trade-offs implies that such conversion algorithms and minimization algorithms 
cannot exist in general. Additionally, equivalence or inclusion problems are undecidable 

for arbitrary context-free languages. 

We have shown that boundcdncss is a structural limitation on context-free languages 
which reduces non-recursive trade-offs to recursive trade-offs. Together with the known 
positively decidable questions such as equivalence or inclusion of bounded context-free 
languages, we obtain that context-free grammars and pushdown automata for bounded 
languages are much more manageable from a practical point of view than in the general 
case. 

References 

[1] L. Balke, K.H. Bohling, Einfiihrung in die Automatentheorie und Theorie formaler 
Sprachen, BI Wissenschaftsverlag, Mannheim, 1993. 

[2] J. Goldstine, M. Kappes, C.M.R. Kintala, H. Leung, A. Malcher, D. Wotschke, De- 
scriptional complexity of machines with limited resources, J.UCS 8 (2002) 193-234. 

[3] S. Ginsburg, The Mathematical Theory of Context-Free Languages, McGraw-Hill 
Book Co., New York, 1966. 

[4] S. Ginsburg, H. Rice, Two families of languages related to ALGOL, J. Assoc. Comput. 
Mach. 9 (1962) 350-371. 

[5] S. Ginsburg, E.H. Spanicr, Finite-turn pushdown automata, SIAM J. Control 4 (1966) 
429-453. 

[6] M.A. Harrison, Introduction to Formal Language Theory, Addison- Wesley, Reading 
MA, 1978. 

[7] M. Holzcr, M. Kutrib, Nondeterministic descriptional complexity of regular languages, 
Internat. J. Found. Comput. Sci. 14 (2003) 1087-1102. 

[8] J.E. Hopcroft, J.D. Ullman, Introduction to Automata Theory, Languages and Com- 
putation, Addison- Wesley, Reading MA, 1979. 

[9] O.H. Ibarra, T. Jiang, On one-way cellular arrays, SIAM J. Comput. 16 (1987) 1135- 
1154. 

[10] A. Kelemenova, Complexity of Normal Form Grammars, Theoret. Comput. Sci. 28 
(1984) 299-314. 

[11] M. Kutrib, The phenomenon of non-recursive trade-offs, Internat. J. Found. Comput. 
Sci. 16 (2005) 957-973. 

[12] A. Malcher, On recursive and non-recursive trade-offs between finite-turn pushdown 
automata, J. Autom. Lang. Comb., to appear. 



30 



[13] A.R. Meyer, M.J. Fischer, Economy of descriptions by automata, grammars, and 
formal systems, IEEE Symposium on Foundations of Computer Science, 1971, pp. 
188-191. 

[14] G. Pighizzini, J. Shallit, M.-W. Wang, Unary context-free grammars and pushdown 
automata, descriptional complexity and auxiliary space lower bounds, J. Comput. 
System Sci. 65 (2002) 393-414. 



31 



